A new class of ranking functions for DCG-like evaluation metrics using conditional probability models

نویسندگان

  • Eunho Yang
  • Pradeep Ravikumar
  • Matthew Lease
چکیده

In the context of learning to rank for information retrieval [15], we study a general class of “DCG-like” ranking loss functions which include DCG [13] and approximate ERR [6] as specific cases. We then study the Bayes optimal ranking function for this class, which is a function of the conditional distribution of graded document relevance levels. Our main contribution is a novel class of ranking functions building upon the Bayes optimality result. Specifically, our ranking function class uses the conditional expectation of a function of the document features with respect to an ostensible multiclass logistic regression model. We empirically evaluate our proposals with respect to NDCG on seven standard learning to rank datasets [16] and three different surrogate evaluation metrics. Results show our proposed ranking functions improve accuracy across datasets and surrogate evaluation metrics, achieving 10.6% improvement on average and as high as 25% in the best case.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Counterfactual Learning-to-Rank for Additive Metrics and Deep Models

Implicit feedback (e.g., clicks, dwell times) is an attractive source of training data for Learning-to-Rank, but it inevitably suffers from biases such as position bias. It was recently shown how counterfactual inference techniques can provide a rigorous approach for handling these biases, but existing methods are restricted to the special case of optimizing average rank for linear ranking func...

متن کامل

Learning the Gain Values and Discount Factors of DCG

Evaluation metrics are an essential part of a ranking system, and in the past many evaluation metrics have been proposed in information retrieval and Web search. Discounted Cumulated Gains (DCG) has emerged as one of the evaluation metrics widely adopted for evaluating the performance of ranking functions used in Web search. However, the two sets of parameters, gain values and discount factors,...

متن کامل

An application of Measurement error evaluation using latent class analysis

‎Latent class analysis (LCA) is a method of evaluating non sampling errors‎, ‎especially measurement error in categorical data‎. ‎Biemer (2011) introduced four latent class modeling approaches‎: ‎probability model parameterization‎, ‎log linear model‎, ‎modified path model‎, ‎and graphical model using path diagrams‎. ‎These models are interchangeable‎. ‎Latent class probability models express l...

متن کامل

A New Method for Ranking Distribution Companies with Several Scenarios Data by Using DEA/MADM

In Data Envelopment Analysis, uncertain data are the inseparable part of real models. Natural models usually deal with uncertain and probable data. Many researchers prioritize these kinds of data. For instance, they study fuzzy data, interval data, probabilistic models etc. In this article, we proposed a method in which the decision making units are uncertain in their inputs and outputs. In the...

متن کامل

Evaluation of IR Applications with Constrained Real Estate

Traditional IR applications assume that there is always enough space (“real estate”) available to display as many results as the system returns. Consequently, traditional evaluation metrics were typically designed to take a length cutoff k of the result list as a parameter. For example, one computes DCG@k, Prec@k, etc., based on the top-k results in the ranking list. However, there are importan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014